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SELECTIVE MEDIA STORAGE BASED ON USER PROFILES 

AND PREFERENCES 

TECHNICAL FIELD 

5 This invention relates generally to electronic data processing and more 

particularly, to selective media storage based on user profiles and preferences. 

BACKGROUND 

A number of different electronic devices have been developed to assist 
10 viewers in recording and viewing of video/audio programming. One such device 
that is increasing in demand is the digital video recorder that allows the user to store 
television programs for subsequent viewing, pause live television, rewind, etc. 

BRIEF DESCRIPTION OF THE DRAWINGS 

15 Embodiments of the invention may be best understood by referring to the 

following description and accompanying drawings which illustrate such 
embodiments. The numbering scheme for the Figures included herein are such that 
the leading number for a given reference number in a Figure is associated with the 
number of the Figure. For example, a system 100 can be located in Figure 1. 

20 However, reference numbers are the same for those elements that are the same 
across different Figures. In the drawings: 

Figure 1 illustrates a block diagram of a system configuration for selective 
media storage based on user profiles and preferences, according to one embodiment 
of the invention. 

25 Figure 2 illustrates a more detailed block diagram of parts of the system 

configuration of Figure 1, according to one embodiment of the invention. 

Figure 3 illustrates the different software and hardware layers for the parts 
of the system configuration of Figure 1, according to one embodiment of the 
invention. 
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Figure 4 illustrates a flow diagram for selective media storage based on user 
profiles and preferences, according to one embodiment of the invention. 

Figure 5 illustrates a flow diagram for selecting the sequences of media data 
based on user preferences/profile, according to one embodiment of the invention. 

5 

DETAILED DESCRIPTION 

Methods, apparatus and systems for selective media storage based on user 
profiles and preferences are described. In the following description, numerous 
specific details are set forth. However, it is understood that embodiments of the 

10 invention may be practiced without these specific details. In other instances, well- 
known circuits, structures and techniques have not been shown in detail in order not 
to obscure the understanding of this description. As used herein, the term "media" 
may include video, audio, metadata, etc. 

This detailed description is divided into three sections. In the first section, 

15 one embodiment of a system is presented. In the second section, embodiments of 
the hardware and operating environment are presented. In the third section, 
embodiments of operations for video storage based on user profiles and preferences 
are described. 

20 System Overview 

In this section, one embodiment of a system is presented. In one 
embodiment, the system illustrated herein may be a part of a set-top box, media 
center etc. In an embodiment, this system is within a personal video recorder 
(PVR). 

25 Figure 1 illustrates a block diagram of a system configuration for selective 

media storage based on user profiles and preferences, according to one embodiment 
of the invention. In particular, Figure 1 illustrates a system 100 that includes a 
receiver 102, a media asset management logic 104, a storage logic 106, a storage 
medium 108 and an I/O logic 124. As further described below, the storage medium 

30 108 includes a number of different databases. While the system 100 illustrates one 
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storage medium for these different databases, embodiments of the invention are not 
so limited, as such databases may be stored across a number of such mediums. 

The receiver 102 is coupled to the storage logic 106 and the media asset 
management logic 104. The media asset management logic 104 is also coupled 
5 back to the receiver 102. The storage logic 106 is coupled to the storage medium 
108. The media asset management logic 104 is coupled to the storage logic 108, the 
display 122, the I/O logic 124 and the storage medium 108. While the display 122 
may be a number of different types of displays, in one embodiment, the video 
display 122 is a cathode ray tube (CRT). In an embodiment, the display 122 is a 
10 plasma display. In one embodiment, the display 1 12 is a liquid crystal display 
(LCD). 

The receiver 102 is coupled to receive a signal, which, in one embodiment, 
is a Radio Frequency (RF) signal that includes a number of different channels of 
video/audio for display on the display 122. In an embodiment, this signal also 

1 5 includes metadata for an Electronic Programming Guide (EPG) that is not adapted 
to a given user of the system 100. For example, the data could include the 
cataloging information (e.g., source, creator and rights) and semantic information 
(e.g., who, when, what and where). 

As further described below, in an embodiment, the media asset management 

20 logic 104 selectively stores television programs and parts thereof based on the past 
viewing profile of the user of the system 100. In one embodiment, the media asset 
management logic 104 selectively stores television programs and parts thereof based 
on at least one cue regarding viewing preferences provided by the user of the system 
100. Such cues may include different characteristics that may be within frames of 

25 the video/audio. For example, the cues may be particular shapes, audio sequences, 
text within the video and/or within the close-captioning, etc. As further described 
below, in an embodiment, the media asset management logic 104 may also 
store/record a program, without commercials that are typically embedded therein. 
Additionally, the media asset management logic 104 may customize the 

30 Electronic Programming Guide (EPG) for a given viewer/user of the system 100. 
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The media asset management logic 104 registers the favorite channels and programs 
therein of the viewer/user based on a differentiation of channel surfing versus actual 
viewing of the programs by the user. To illustrate, assume that the user of the 
system 100 uses the EPG to select professional football for viewing on Monday 
5 nights on channel 38. Moreover, assume that the user of the system 100 uses the 
EPG to select the prime time news on channel 25 for viewing. Such selections are 
recorded in a profile database for the user. Accordingly, such selections are 
registered in a database in the system 100. The media asset management logic 104 
may use these registered selections to customize the EPG such that the viewer/user 
10 is presented with a shortened list of channels and/or programs for viewing within 
the EPG. 

Moreover, in an embodiment, the cues regarding viewing preferences may 
be inputted by the user through multimodal interfaces. For example, the user of the 
system 100 may select a video and/or audio sequence/clip from a program that the 

15 user is viewing. In another embodiment, the user of the system 100 may input a 
video and/or audio clip through other input devices. For example, the system 100 
may be coupled to a computer, wherein the user may input such clips. To illustrate, 
the user may only desire to view the scoring highlights from a soccer match. 
Therefore, the user may input a video clip of a professional soccer player scoring a 

20 goal. The media asset management logic 104 may then record all of the goals 

scored in a given soccer match. Accordingly, because the number of goals scored in 
a soccer match is typically limited, the storage space for such highlights is much 
less in comparison to the storage space for the entire soccer match. Examples of 
other types of input through multimodal interfaces may include a voice of an actor 

25 or sports announcer, a voice sequence of a phrase or name ("goal", "Jordan scores", 
etc.), different shapes or textures within the video, text from close captioning, text 
embedded with the video, etc. 

In an embodiment, the storage logic 106 receives and stores the incoming 
media data (video, audio and metadata) into a temporary work space within the 

30 media database 224. The media asset management logic 104 may subsequently 
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process this media data. Based on the processing, in an embodiment, the media 
asset management logic 104 may store only parts of such media data based on the 
past viewing profile of the user and/or the cues for the different preferences from the 
user. Accordingly, embodiments of the invention are able to process the incoming 
5 media data in near real time and select what programs and parts thereof are to be 
recorded that are specific to a given user. Embodiments of the invention, therefore, 
may record "interesting" (relative to the user) parts of the "right" (relative to the 
user) programs using cues provided by the user. 

10 Hardware and Operating Environment 

In this section, a hardware and operating environment are presented. In 
particular, this section illustrates a more detailed block diagram of one embodiment 
of parts of the system 100. 

Figure 2 illustrates a more detailed block diagram of parts of the system 

15 configuration of Figure 1, according to one embodiment of the invention. As 
shown, the receiver 102 includes a tuner 202, a transport demuxer 204 and a 
decoder 206. The storage logic 106 includes a time shift logic 208 and an encoder 
210. The media asset management logic 104 includes a media asset control logic 
214, a shape recognition logic 216, a voice recognition logic 218, a text recognition 

20 logic 220, a texture recognition logic 221 and a sequence composer logic 222. 

The storage medium 108 includes a media database 224, an EPG database 
226, a preference database 228, a profile database 230, a presentation quality 
database 232 and a terminal characteristics database 234. In an embodiment, the 
EPG database 226 is representative of at least two different EPG databases. The 

25 first EPG database stores the EPG exported by the service provider of the media 
data signal (e.g., the cable or satellite television service providers). The second 
EPG database stores EPGs that are specific to the users of the system 100 based on 
the selective media storage operations, which are further described below. 

The tuner 202 is coupled to receive a media data signal from the service 

30 provider. The tuner 202 is coupled to the transport demuxer 204. The transport 
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demuxer 204 is coupled to the decoder 206. The decoder 206 is coupled to the time 
shift logic 208. The encoder 210 is coupled to the time shift logic 208. The time 
shift logic 208 is coupled to the media database 224. 

The media asset control logic 214 is coupled to the tuner 202, the shape 
5 recognition logic 216, the voice recognition logic 218, the text recognition logic 
220, the texture recognition logic 221, the sequence composer logic 222, the time 
shift logic 208 and the I/O logic 124. The media asset control logic 214 is also 
coupled to the EPG database 112, the preference database 1 14, the profile database 
116, the presentation quality database 118 and the terminal characteristics database 
10 120. The sequence composer logic 222 is coupled to the encoder 210 and the EPG 
database 226. 

The time shift logic 208 is coupled to the media database 224. The media 
asset management logic 104 is coupled to the EPG database 226, the preference 
database 228, the profile database 230, the presentation quality database 232 and the 

15 terminal characteristics database 234. The presentation quality database 232 stores 
the configuration information regarding the quality of the video being stored and 
displayed on the display 122. Such configuration information may be configured by 
the user of the system 100 on a program-by-program basis. Accordingly, the media 
asset management logic 104 may use the presentation quality database 232 to 

20 determine the amount of data to be stored for a program. The terminal 

characteristics database 234 stores data related to the characteristics of the display 
122 (such as the size of the screen, number of pixels, number of lines, etc.). 
Therefore, the media asset management logic 104 may use these characteristics 
stored therein to determine how to configure the video for display on the display 

25 122. 

While different components of the system 100 illustrated in Figure 2 can be 
performed in different combinations of hardware and software, one embodiment of 
the partitioning of the different components of the system 100 into different 
software and hardware layers is now described. In particular, Figure 3 illustrates 
30 the different software and hardware layers for the parts of the system configuration 
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of Figure 1, according to one embodiment of the invention. Figure 3 illustrates a 
user interface layer 302, an application layer 304, a resource management layer 306 
and a hardware layer 308. 

The user interface layer 302 includes the I/O logic 124. As further described 
5 below, the I/O logic 124 receives input from the user for controlling and configuring 
the system 100. For example, the I/O logic 124 may receive different cues for 
preferences from the user via different multimodal interfaces. In one embodiment, 
an input source may be a remote control that has access to multimodal interfaces 
(e.g., voice, graphics, text, etc.) As shown, the I/O logic 124 is coupled to forward 

10 such input to the media asset control logic 214. 

The application layer 304 includes the media asset control logic 214, the 
shape recognition logic 218, the text recognition logic 220, the texture recognition 
logic 221 and the sequence composer logic 222. The resource management layer 
306 includes components therein (not shown) that manages the underlying hardware 

15 components. For example, if the hardware layer 308 includes multiple decoders 
206, the resource management layer 306 allocates decode operations across the 
different decoders 206 based on availability, execution time, etc. The hardware 
layer 308 includes the tuner 202, the transport demuxer 204, the decoder 206, the 
time shift logic 208 and the encoder 210. 

20 Embodiments of the invention are not limited to the layers and/or the 

location of components in the layers illustrated in Figure 3. For example, in another 
embodiment, the decoder 206 and/or the encoder 210 may be performed by software 
in the application layer 304. 

25 Selective Media Storage Operations Based on User Profiles/Preferences 

Embodiments of selective media storage operations based on user 
profiles/preferences are now described. In particular, embodiments of the 
operations of the system 100 are now described. Figure 4 illustrates a flow diagram 
for selective media storage based on user profiles and preferences, according to one 

30 embodiment of the invention. 
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In block 402, a media data signal is received into a device coupled to a 
display. With reference to the embodiment of Figure 2, the tuner 202 receives the 
media data signal. In an embodiment, the media signal that includes a number of 
different channels having a number of different programs for viewing. Control 
5 continues at block 404. 

In block 404, a past viewing profile for a user of the device and at least one 
cue regarding viewing preferences provided by the user is retrieved. With reference 
to the embodiment of Figure 2, the media asset control logic 214 retrieves the past 
viewing profile of the user of the system 100 from the profile database 230. The 

10 media asset control logic 214 generates the profile for a user of the system 100 

based on the viewing habits of the user. In particular, the media asset control logic 
214 monitors what the user of the system 100 is viewing on the display 122. For 
example, the user may be viewing an incoming program (independent of the 
recording operations of the system 100) through the tuner 202 and/or a different 

15 tuner (not shown). Accordingly, the media asset control logic 214 registers the 
metadata for such programs into the profile database 230. Additionally, the media 
asset control logic 214 may monitor the programs recorded and stored in the media 
database 224, in conjunction with and/or independent of the recording operations 
described herein. For example, the user of the system 100 may request the 

20 recording of a program that is independent of the video storage operations described 
herein. In an embodiment, the media asset control logic 214 differentiates surfing 
of the channels versus actual viewing of the program. In one such embodiment, the 
media asset control logic 214 makes this differentiation based on the length of time 
the viewer is viewing the program. For example, if the viewer watches more than 

25 20% of a given program (either consecutively and/or disjointly), the media asset 
control logic 214 registers this program. 

In an embodiment, the media asset control logic 214 also retrieves at least 
one cue regarding viewing preferences provided by the user of the system 1 00 from 
the preference database 228. As described above, the user of the system 100 may 

30 input viewing preferences into the system 100 through the I/O logic 124. In 
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particular, the user of the system 100 may input video sequences, clip art, audio 
sequences, text, etc. through a number of different multimodal interfaces. Control 
continues at block 406. 

In block 406, a program in the media data signal is selected based on the past 
5 viewing profile of the user. With reference to the embodiment of Figure 2, the 
media asset control logic 214 selects the program on a channel that is within the 
media data signal. In an embodiment, the tuner 202 receives the media data signal. 
In an embodiment, the tuner 202 converts the media data signal (that is received) 
into a program transport stream based on the channel that is currently selected for 

10 viewing. In one embodiment, the media asset control logic 214 controls the tuner 
202 to be tuned to a given channel. For example, based on the preferences of the 
user, the media asset control logic 214 determines that highlights of a soccer match 
on channel 28 are to be recorded. Therefore, the media asset control logic 214 
causes the tuner to tune to channel 28. 

15 In given situations, multiple programs across multiple channels (which are 

considered to be "favorites" for the user and part of the profile for the user) are 
being received within the media data signal for viewing at the same time. In one 
embodiment, the media asset control logic 214 prioritizes which of such programs 
are to be selected. The media asset control logic 214 may store a priority list for the 

20 different "favorite" programs based on user configuration, the relative viewing time 
of the different "favorite" programs, etc. For example, the viewer may have viewed 
100 different episodes of a given situational comedy, while only having viewed 74 
different professional soccer matches. Therefore, if a time conflict arises, the media 
asset control logic 214 selects the situational comedy. Moreover, while the system 

25 100 illustrates one tuner 202, embodiments of the invention are not so limited, as the 
system 100 may include a greater number of such tuners. Accordingly, the media 
asset control logic 214 may resolve time conflicts between multiple "favorite" 
programs based on different tuners tuning to the different programs for processing 
by the media asset control logic 214. Control continues at block 408. 
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In block 408, at least one sequence in the program is selected based on the at 
least one cue regarding viewing preferences provided by the user. With reference to 
the embodiment of Figure 2, the media asset management logic 104 selects this at 
least one sequence. In particular, the tuner 202 outputs the transport stream 
5 (described above) to the transport demuxer 204. The transport demuxer 204 de- 
multiplexes the transport stream into a video stream and an audio stream and 
extracts metadata for the program. In one embodiment, the transport demuxer 204 
de-multiplexes the single program stream based on a Program Association Table 
(PAT) and a Program Management Table (PMT) that are embedded in the stream. 

10 The transport demuxer 204 reads the PAT to locate the PMT. The transport 
demuxer 204 indexes into the PMT to locate the program identification for the 
program or parts thereof to be recorded. The transport demuxer 204 outputs the 
video stream, audio stream and metadata to the decoder 206. 

The decoder 206 decompresses the video, audio and metadata to generate 

15 video frames, audio frames and metadata. In an embodiment, the decoder 205 

marks the frames with a timeline annotation. For example, the first frame includes 
an annotation of one, the second frame includes an annotation of two, etc. The 
decoder 206 outputs these frames to the time shift logic 208. In an embodiment, the 
time shift logic 208 receives and stores these frames into a temporary workspace 

20 within the media database 224. Additionally, the time shift logic 208 transmits 

these video, audio and metatdata frames to the media asset control logic 214 within 
the media asset management logic 104. Components in the media asset 
management logic 104 selects the at least one sequence in the program. A more 
detailed description of this selection operation is described in more detail below in 

25 conjunction with the flow diagram 500 of Figure 5. Control continues at block 410. 
In block 410, the selected sequences are stored. With reference to the 
embodiment of Figure 2, the media asset control logic 214 stores the selected 
sequences into the media database 224. Moreover, in an embodiment, media asset 
control logic 214 updates the three index tables (one for normal play, one for fast 
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forward and one for fast reverse) in reference to the selected sequences. Control 
continues at block 412. 

In block 412, the Electronic Programming Guide (EPG) specific to the user 
is updated with the selected sequences. With reference to the embodiment of Figure 
5 2, the sequence composer logic 222 updates the EPG specific to the user with the 
selected sequences. In an embodiment, the EPG for the user is stored in the EPG 
database 226. The EPG is a guide that may be displayed to the user on the display 
122 that includes the different programs and sequences within programs that are 
stored in the system 100 for viewing by the user. Accordingly, when selected 

1 0 sequences are stored that are specific to the user based on their profile and 

preferences, the EPG for the user is updated with such sequences. For example, if 
the media asset management logic 104 stores scoring highlights from a soccer match 
for the user, the EPG is updated to reflect the storage of these highlights. 

In an embodiment, the sequence composer logic 222 generates a metadata 

15 table that is stored in the media database 224. The metadata table includes metadata 
related to the frames within the selected sequences. Such metadata includes 
cataloging information (source, creator, rights), semantic information (who, when, 
what, where) and generated structural information (e.g., motion characteristics or 
face signature, caption keywords, voice signature, etc.). In an embodiment, the EPG 

20 for this user references this metadata table. 

A more detailed description of the selection of sequences within a program 
based viewing preferences of a user is now described. In particular, Figure 5 
illustrates a flow diagram for selecting the sequences of media data based on user 
preferences/profile, according to one embodiment of the invention. The operations 

25 of the flow diagram 500 are for a given number of frames that are stored in a 

temporary workspace. In one embodiment, a temporary workspace for a number of 
frames is allocated within the media database 224. Therefore, the operations of the 
flow diagram 500 may be repeatedly performed on the frames for a selected 
program to which the tuner 202 is tuned. For example, this temporary workspace 

30 may be 10 minutes of media data frames. However, embodiments of the invention 
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are not so limited. For example, in another embodiment, the operations of the flow 
diagram 500 may be performed while the frames are received from the time shift 
logic 208. Additionally, in an embodiment, the operations of the flow diagram 500 
are repeated until the frames of the selected program have been processed. 

The operations of the flow diagram 500 commence in blocks 506, 507, 508 
and 509. As shown, in an embodiment, the operations within the blocks 506, 507, 
508 and 509 are performed in parallel at least in part by different logic within the 
system 100. Moreover, in an embodiment, the operations of the flow diagram 500 
remain at point 511 until each of the operations in blocks 506, 507, 508 and 509 are 
complete. However, embodiments of the invention are not so limited. For example, 
in one embodiment, a same logic may serially perform each of the different 
operations in blocks 506, 507, 508 and 509. Additionally, in an embodiment, the 
operations in blocks 506, 507, 508, 509, 512, 514, 516, 518, 520 and 522 are for a 
given frame. 

In block 506, a voice recognition match score is generated. With reference 
to the embodiment of Figure 2, the voice recognition logic 218 generates the voice 
recognition score. In particular, the media asset control logic 214 transmits voice- 
related preferences for the user along with the frame of audio. The voice 
recognition logic 218 generates this score based on how well the voice-related 
preferences match the audio in the frame. For example, there may be 50 different 
voice-related preferences, which are each compared to the audio in the frame. To 
illustrate, these voice-related preferences may be different audio clips that the user 
has inputted as a preference. A first audio clip could be the voice of a sport 
announcer saying "Jordan scores." A second audio clip could be the voice of a 
favorite actor. In an embodiment, the voice recognition logic 218 performs a 
comparison of a preference to the audio in the frame based on the catalog (source, 
creator, rights) and the semantic (who, when, what, where) information associated 
with the preference and the frame. For example, if a given frame is related to a 
basketball game, only those preferences that are from and/or related to a basketball 
game are compared to the audio in the frame. In an embodiment, the voice 
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recognition logic 218 generates an eight bit (0-255) normalized component match 
score. Accordingly, the voice recognition logic 218 generates a relative high match 
score if likelihood of a match between one of the voice-related preferences and the 
audio in the frame is high. The voice recognition logic 218 outputs the voice 
5 recognition match score to the media asset control logic 214. Control continues at 
block 512. 

In block 507, a shape recognition match score is generated. With reference 
to the embodiment of Figure 2, the shape recognition logic 216 generates the shape 
recognition match score. In particular, the media asset control logic 214 transmits 

10 shape-related preferences for the user along with the frame of video. The shape 
recognition logic 216 generates this score based on how well the shape-related 
preferences match the shapes in the frame of the video. For example, there may be 
25 different shape-related preferences, which are compared to the shapes in the 
frame. To illustrate, these shape-related preferences may include the faces of 

15 individuals, the shapes involving a player scoring a goal in soccer or basketball, text 
that shows the score of a sporting event, etc. In an embodiment, the shape 
recognition logic 216 performs a comparison of a preference to the shapes in the 
frame based on the catalog and the semantic information associated with the 
preference and the frame. In an embodiment, the shape recognition logic 216 

20 generates an eight bit (0-255) normalized component match score. Accordingly, the 
shape recognition logic 216 generates a relative high match score if likelihood of a 
match between one of the shape-related preferences and the shapes in the frame is 
high. The shape recognition logic 216 outputs the shape recognition match score to 
the media asset control logic 214. Control continues at block 512. 

25 In block 508, a text recognition match score is generated. With reference to 

the embodiment of Figure 2, the text recognition logic 220 generates the text 
recognition match score. In particular, the media asset control logic 214 transmits 
text-related preferences for the user along with the close-captioned text associated 
with the frame. The text recognition logic 220 generates this score based on how 

30 well the text-related preferences match the close-captioned text associated with the 



Atty. Dkt. No.:884.B73USl 



Client Ref.No.: P18180 



frame. For example, there may be 40 different text-related preferences, which are 
compared to the close-captioned text associated with the frame. To illustrate, these 
text-related preferences may include text that is generated in close captioning for a 
program or sequence thereof For example, the text could be the name of a 
5 character in a movie, the name of the movie, the name of the sports announcer, 
sports athlete, etc. In an embodiment, the text recognition logic 220 performs a 
comparison of a preference to the close-captioned text in the frame based on the 
catalog and the semantic information associated with the preference and the frame. 
In an embodiment, the text recognition logic 220 generates an eight bit (0-255) 

10 normalized component match score. Accordingly, the text recognition logic 220 
generates a relative high match score if likelihood of a match between one of the 
text-related preferences and the close-captioned text in the frame is high. The text 
recognition logic 220 outputs the text recognition match score to the media asset 
control logic 214. Control continues at block 512. 

15 In block 509, a texture recognition match score is generated. With reference 

to the embodiment of Figure 2, the texture recognition logic 221 generates the 
texture recognition match score. In particular, the media asset control logic 214 
transmits texture-related preferences for the user along with the frame of video. The 
texture recognition logic 221 generates this score based on how well the texture- 

20 related preferences match the texture in the frame of the video. For example, there 
may be 15 different texture-related preferences, which are compared to the different 
textures in the frame. To illustrate, these shape-related preferences may include the 
texture of a football field, a basketball court, a soccer field, etc. In an embodiment, 
the texture recognition logic 221 performs a comparison of a preference to the 

25 textures in the frame based on the catalog and the semantic information associated 
with the preference and the frame. In an embodiment, the texture recognition logic 
221 generates an eight bit (0-255) normalized component match score. 
Accordingly, the texture recognition logic 221 generates a relative high match score 
if likelihood of a match between one of the texture-related preferences and the 

30 textures in the frame is high. The texture recognition logic 221 outputs the texture 
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recognition match score to the media asset control logic 214. Control continues at 
block 512. 

In block 512, a weighted score is generated. With referenced to the 
embodiment of Figure 2, the media asset control logic 214 generates the weighted 
5 score for this frame. In particular, the media asset control logic 214 generates this 
weighted score based on the voice recognition match score, the shape recognition 
match score, the text recognition match score and the texture recognition match 
score. In an embodiment, the media asset control logic 214 assigns a weight to 
these different component match scores based on the type of programming. For 
10 example, for sports-related programs, the media asset control logic 214 may use a 
weighted combination of voice, shape and text. For home shopping-related 
programs, the media asset control logic 214 may use a weighted combination of 
voice, texture and text. 

Table 1 shown below illustrates one example of the assigned weights for the 
1 5 different component match scores based on the type of programming. 



Programming Type 


Weights per Component Type 


Shape 


Texture 


Voice 


Text 


Hit % 


Sports/Soccer/World-Cup 


0.33 




0.34 


0.33 


100 


Sports/Basketball/NBA 


0.20 




0.40 


0.40 


100 


News/TV/commercials 




0.33 


0.33 


0.34 


75 


Business/home- shopping/j ewelry 




0.33 


0.27 


0.40 


100 



Table 1 



In one embodiment, the media asset control logic 214 determines the type of 
20 program based on the semantic metatdata that is embedded in the media data signal 
being received into the system 100. The media asset control logic 214 multiplies 
the weights by the associated component match score and adds the multiplied values 
to generate the weighted score. Control continues at block 514. 

In block 514, a determination is made of whether the weighted score exceeds 
25 an acceptance threshold. With reference to the embodiment of Figure 2, the media 
asset control logic 214 makes this determination. In an embodiment, the acceptance 
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threshold is a value that may be configurable by the user. Therefore, the user may 
allow for more or less tolerance for the recording of certain unintended video 
sequences. In one embodiment, the acceptance threshold is based on the size of the 
storage medium. For example, if the size of the storage medium 108 is 80 
5 Gigabytes, the acceptance threshold may be less in comparison to a system 108 
wherein the size of the storage medium 108 is 40 Gigabytes. 

In block 516, upon determining that the weighted score does not exceed the 
acceptance threshold, the frame is marked as "rejected." With reference to the 
embodiment of Figure 2, the media asset control logic 214 marks this frame as 

10 "rejected." Accordingly, such frame will not be stored in the media database 224 
for possible subsequent viewing by the user. Control continues at block 520, which 
is described in more detail below. 

In block 518, upon determining that the weighted score is equal to or 
exceeds the acceptance threshold, the frame is marked as "accepted." With 

15 reference to the embodiment of Figure 2, the media asset control logic 214 marks 
this frame as "accepted." Accordingly, such frame will be stored in the media 
database 224 for possible subsequent viewing by the user. Control continues at 
block 520. 

In block 520, a determination is made of whether the end of the frame 
20 workspace has been reached. With reference to the embodiment of Figure 2, the 
media asset control logic 214 makes this determination. In particular, in one 
embodiment, a temporary workspace for a number of frames is allocated within the 
media database 224. Accordingly, the operations of the flow diagram 500 are for a 
given number of frames in this workspace. Therefore, the operations of the flow 
25 diagram 500 may be repeatedly performed on the frames for the given channel to 
which the tuner 202 is tuned. For example, this temporary workspace may be 10 
minutes of video/audio frames. However, embodiments of the invention are not so 
limited. For example, in another embodiment, the operations of the flow diagram 
500 may be performed while the frames are received from the time shift logic 208. 
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In block 522, upon determining that the end of the frame workspace has not 
been reached, the frame sequence is incremented. With reference to the 
embodiment of Figure 2, the media asset control logic 214 increments the frame 
sequence. As described above, in an embodiment, the decoder 206 marks the 
5 frames with timeline annotations, which serve as the frame sequence. The media 
asset control logic 214 increments the frame sequence to allow for the processing of 
the next frame within the frame workspace. Control continues at blocks 506, 507, 
508 and 509, wherein the match scores are generated for the next frame in the frame 
workspace. Therefore, the operations in blocks 506, 507, 508, 509, 512, 514, 516, 

10 518, 520 and 522 continue until all of the frames in the frame workspace have been 
marked with as "rejected" or "accepted." Because the voices, shapes, text and 
texture for frames change slowly over time, a series of consecutive frames (e.g., 2 
minutes of video) are typically marked as "accepted", which are followed by a 
series of frames that are marked as "rejected", etc. For example, if 5000 frames 

15 include the video of a soccer player scoring a goal, which matches one of the 
preferences of the user, the 5000 frames are marked as "accepted." 

In block 524, upon determining that the end of the frame workspace has 
been reached, the start/stop sequences are marked. With reference to the 
embodiment of Figure 2, the sequence composer logic 222 marks the start/stop 

20 sequences. The sequence composer logic 222 marks the start/stop sequences of the 
frames based on the marks of "rejected" and "accepted" for the frames. The 
sequence composer logic 222 marks the start sequence from the first frame that is 
marked as "accepted" that is subsequent to a frame that is marked as "rejected." 
The sequence composer logic 222 marks the stopping point of this sequence for the 

25 last frame that is marked as "accepted" that is prior to a frame that is marked as 
"rejected." Therefore, the sequence composer logic 222 may mark a number of 
different start/stop sequences that are to be stored in the media database 224, which 
may be subsequently viewed by the user. In an embodiment, a start/stop sequence 
continues from one frame workspace to a subsequent frame workspace. Therefore, 
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the sequence composer logic 222 marks start/stop sequences across a number of 
frame workspaces. Control continues at block 526. 

In block 526, the frames in the start/stop sequences are resynchronized. 
With reference to the embodiment of Figure 2, the sequence composer logic 222 
5 resynchronizes the frames in the start/stop sequences. In an embodiment, the 

sequence composer logic 222 resynchronizes by deleting the frames that are not in 
the start/stop sequences (those frames marked as "rejected"). In an embodiment, the 
sequence composer logic 222 defragments the frame workspace by moving the 
start/stop sequences together for approximately continuous storage therein. 

10 Accordingly, this de- fragmentation assists in efficiently usage of the storage in the 
media database 224. In an embodiment, the sequence composer logic 222 transmits 
the resynchronized start/stop sequences to the encoder 210. The encoder 210 
encodes these sequences, prior to storage into the media database 224. The 
operations of the flow diagram 500 are complete. 

15 While the flow diagram 500 illustrates four different component match 

scores for the frames, embodiments of the invention are not so limited, as a lesser or 
greater number of such component match scores may be incorporated into the 
operations of the flow diagram 500. For example, in another embodiment, a 
different component match score related to the colors, motion, etc. in the frame of 

20 video could be generated and incorporated into the weighted score. 

Moreover, the flow diagram 500 may be modified to allow for the 
recording/storage of a program without the commercials that may be embedded 
therein. In particular, the flow diagram 500 illustrates the comparison between 
characteristics in a frame and preferences of the user/viewer. However, in an 

25 embodiment, the characteristics in a frame may be compared to characteristics of 
commercials (similar to blocks 506-509). A weighted score is generated which 
provides an indication of whether the frame is part of a commercial. Accordingly, 
such frames are marked as "rejected", while other frames are marked as "accepted", 
thereby allowing for the storage of the program independent of the commercials. 
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While the characteristics of commercials may be determine based on a 
number of different operations, in one embodiment, the viewer/user may train the 
system 100 by inputting a signal into the I/O logic 124 at the beginning point and 
ending point of commercials while viewing programs. Therefore, the media asset 
5 management logic 104 may process the frames within these marked commercials to 
extract relevant shapes, audio, text, texture, etc. Such extracted data may be stored 
in the storage medium 108. 

In the description, numerous specific details such as logic implementations, 
opcodes, means to specify operands, resource partitioning/sharing/duplication 

10 implementations, types and interrelationships of system components, and logic 
partitioning/integration choices are set forth in order to provide a more thorough 
understanding of the present invention. It will be appreciated, however, by one 
skilled in the art that embodiments of the invention may be practiced without such 
specific details. In other instances, control structures, gate level circuits and full 

15 software instruction sequences have not been shown in detail in order not to obscure 
the embodiments of the invention. Those of ordinary skill in the art, with the 
included descriptions will be able to implement appropriate functionality without 
undue experimentation. 

References in the specification to "one embodiment", "an embodiment", "an 

20 example embodiment", etc., indicate that the embodiment described may include a 
particular feature, structure, or characteristic, but every embodiment may not 
necessarily include the particular feature, structure, or characteristic. Moreover, 
such phrases are not necessarily referring to the same embodiment. Further, when a 
particular feature, structure, or characteristic is described in connection with an 

25 embodiment, it is submitted that it is within the knowledge of one skilled in the art 
to affect such feature, structure, or characteristic in connection with other 
embodiments whether or not explicitly described. 

Embodiments of the invention include features, methods or processes that 
may be embodied within machine-executable instructions provided by a machine- 

30 readable medium. A machine-readable medium includes any mechanism which 
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provides (i.e., stores and/or transmits) information in a form accessible by a 
machine (e.g., a computer, a network device, a personal digital assistant, 
manufacturing tool, any device with a set of one or more processors, etc.). In an 
exemplary embodiment, a machine-readable medium includes volatile and/or non- 
5 volatile media (e.g., read only memory (ROM), random access memory (RAM), 
magnetic disk storage media, optical storage media, flash memory devices, etc.), as 
well as electrical, optical, acoustical or other form of propagated signals (e.g., 
carrier waves, infrared signals, digital signals, etc.)). 

Such instructions are utilized to cause a general or special purpose processor, 

10 programmed with the instructions, to perform methods or processes of the 
embodiments of the invention. Alternatively, the features or operations of 
embodiments of the invention are performed by specific hardware components 
which contain hard-wired logic for performing the operations, or by any 
combination of programmed data processing components and specific hardware 

1 5 components. Embodiments of the invention include software, data processing 
hardware, data processing system-implemented methods, and various processing 
operations, further described herein. 

A number of figures show block diagrams of systems and apparatus for 
selective media storage based on user profiles and preferences, in accordance with 

20 embodiments of the invention. A number of figures show flow diagrams illustrating 
operations for selective media storage based on user profiles and preferences. The 
operations of the flow diagrams will be described with references to the 
systems/apparatus shown in the block diagrams. However, it should be understood 
that the operations of the flow diagrams could be performed by embodiments of 

25 systems and apparatus other than those discussed with reference to the block 

diagrams, and embodiments discussed with reference to the systems/apparatus could 
perform operations different than those discussed with reference to the flow 
diagram. 

In view of the wide variety of permutations to the embodiments described 
30 herein, this detailed description is intended to be illustrative only, and should not be 
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taken as limiting the scope of the invention. To illustrate, while the system 100 
illustrates one tuner 202, in other embodiments, a greater number of tuners may be 
included therein. Accordingly, the system 100 may record parts of two different 
programs that are on different channels at the same time. For example, the system 
5 100 may record highlights of a soccer match on channel 55 using a first tuner, while 
simultaneously, at least in part, recording a movie without the commercials on 
channel 43. What is claimed as the invention, therefore, is all such modifications as 
may come within the scope and spirit of the following claims and equivalents 
thereto. Therefore, the specification and drawings are to be regarded in an 
1 0 illustrative rather than a restrictive sense. 
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